WWW access to the SYSTERS protein sequence cluster set
نویسندگان
چکیده
SUMMARY We present a Web server where the SYSTERS cluster set of the non-redundant protein database consisting of sequences from SWISS-PROT and PIR is being made available for querying and browsing. The cluster set can be searched with a new sequence using the SSMAL search tool. Additionally, a multiple alignment is generated for each cluster and annotated with domain information from the Pfam protein family database. AVAILABILITY The server address is http://www.dkfz-heidelberg.de/tbi/services/cluster/ systersform
منابع مشابه
The SYSTERS protein sequence cluster set
The SYSTERS (short for SYSTEmatic Re-Searching) protein sequence cluster set consists of the classification of all sequences from SWISS-PROT and PIR into disjoint protein family clusters and hierarchically into superfamily and subfamily clusters. The cluster set can be searched with a sequence using the SSMAL search tool or a traditional database search tool like BLAST or FASTA. Additionally a ...
متن کاملSYSTERS, GeneNest, SpliceNest: exploring sequence space from genome to protein
We have integrated the protein families from SYSTERS and the expressed sequence tag (EST) clusters from our database GeneNest with SpliceNest, a new database mapping EST contigs into genomic DNA. The SYSTERS protein sequence cluster set provides an automatically generated classification of all sequences of the SWISS-PROT, TrEMBL and PIR databases into disjoint protein family and superfamily clu...
متن کاملThe SYSTERS Protein Family Database in 2005
The SYSTERS project aims to provide a meaningful partitioning of the whole protein sequence space by a fully automatic procedure. A refined two-step algorithm assigns each protein to a family and a superfamily. The sequence data underlying SYSTERS release 4 now comprise several protein sequence databases derived from completely sequenced genomes (ENSEMBL, TAIR, SGD and GeneDB), in addition to t...
متن کاملThe SYSTERS Protein Family Web Server: Shortcut from large-scale sequence information to phylogenetic information SYSTERS superfamily 114462 comprises most of the Cation efflux domain proteins in Arabidopsis thaliana
With this poster [11], we present the SYSTERS protein family database, an attempt to classify all available protein sequences. In particular, we focus on the capability of the web interface to assist in in-depth analyses of special protein families. We demonstrate this by an analysis of a specific family of transmembraneous metal ion transport proteins characterised by the so called cation effl...
متن کاملThe SYSTERS protein family database: Taxon-related protein family size distributions and singleton frequencies
Based on the SYSTERS protein family database, we present taxon-related protein family frequencies and distributions. A set of taxon-related protein families is a subset of the whole family set with respect to one taxon, where taxon is not restricted to the species level but may be any rank in the taxonomy. We examine eight ranks in the lineages of seven organisms. A strong linear correlation is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 15 3 شماره
صفحات -
تاریخ انتشار 1999